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RAW SEQUENCE LISTING DATE: 04/26/2001 

PATENT APPLICATION: US/09/769,066 TIME: 17:06:08 

Input Set : N:\Crf3\RULE60\09769066.txt 
Output Set: N:\CRF3\04262001\I769066.raw 

SEQUENCE LISTING 

3 (1) GENERAL INFORMATION: 

5 (i) APPLICANT: Fuerst, Thomas R. 

6 McAtee, C. Patrick 

7 Yarbough, Patrice O. 

8 Zhang, Yif an 

10 (ii) TITLE OF INVENTION: HEPATITIS E VIRUS ANTIGENS AND USES THEREFOR 

12 (iii) NUMBER OF SEQUENCES: 31 

14 (iv) CORRESPONDENCE ADDRESS: 

15 (A) ADDRESSEE: Dehlinger & Associates 

16 (B) STREET: 350 Cambridge Ave., Suite 250 

17 (C) CITY: Palo Alto 

18 (D) STATE: CA 

19 (E) COUNTRY: USA 

20 (F) ZIP: 94306 

22 (V) COMPUTER READABLE FORM: 

23 (A) MEDIUM TYPE: Floppy disk 

24 (B) COMPUTER: IBM PC compatible 

25 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

26 (D) SOFTWARE: Patentln Release #1.0, Version #1.25 
28 (vi) CURRENT APPLICATION DATA: 

C--> 29 (A) APPLICATION NUMBER: US/09/769,066 

C--> 30 (B) FILING DATE: 24-Jan-2001 

31 (C) CLASSIFICATION: 

3 3 (vii) PRIOR APPLICATION DATA: 

34 (A) APPLICATION NUMBER: 08/542,634 

35 (B) FILING DATE: 

38 (viii) ATTORNEY/AGENT INFORMATION: 

3 9 (A) NAME: Fabian, Gary R. 

40 (B). REGISTRATION NUMBER: 33,875 

41 (C) REFERENCE/DOCKET NUMBER: 4600-0293.30 

4 3 (ix) TELECOMMUNICATION INFORMATION: 

44 (A) TELEPHONE: (415) 324-0880 

45 (B) TELEFAX: (415) 324-0960 
47 (2) INFORMATION FOR SEQ ID NO : 1: 

4 9 (i) SEQUENCE CHARACTERISTICS: 

50 (A) LENGTH: 2049 base pairs 

51 (B) TYPE: nucleic acid 

W--> 60 (C) STRANDEDNESS: Hepatitis E Virus (Burma strain) 

61 ORF-2 

53 (D) TOPOLOGY: linear 

55 (ii) MOLECULE TYPE: DNA (genomic) 

57 (iii) HYPOTHETICAL: NO 

59 (vi) ORIGINAL SOURCE: 

64 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

66 ATGCGCCCTC GGCCTATTTT GTTGCTGCTC CTCATGTTTT TGCCTATGCT GCCCGCGCCA 60 
68 CCGCCCGGTC AGCCGTCTGG CCGCCGTCGT GGGCGGCGCA GCGGCGGTTC CGGCGGTGGT 120 
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PATENT APPLICATION: US/09/769,066 TIME: 17:06:08 
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70 TTCTGGGGTG ACCGGGTTGA TTCTCAGCCC TTCGCAATCC CCTATATTCA TCCAACCAAC 180 

72 CCCTTCGCCC CCGATGTCAC CGCTGCGGCC GGGGCTGGAC CTCGTGTTCG CCAACCCGCC 240 

74 CGACCACTCG GCTCCGCTTG GCGTGACCAG GCCCAGCGCC CCGCCGTTGC CTCACGTCGT 300 

76 AGACCTACCA CAGCTGGGGC CGCGCCGCTA ACCGCGGTCG CTCCGGCCCA TGACACCCCG 360 

78 CCAGTGCCTG ATGTCGACTC CCGCGGCGCC ATCTTGCGCC GGCAGTATAA CCTATCAACA 420 

80 TCTCCCCTTA CCTCTTCCGT GGCCACCGGC ACTAACCTGG TTCTTTATGC CGCCCCTCTT 480 

82 AGTCCGCTTT TACCCCTTCA GGACGGCACC AATACCCATA TAATGGCCAC GGAAGCTTCT 540 

84 AATTATGCCC AGTACCGGGT TGCCCGTGCC ACAATCCGTT ACCGCCCGCT GGTCCCCAAT 600 

86 GCTGTCGGCG GTTACGCCAT CTCCATCTCA TTCTGGCCAC AGACCACCAC CACCCCGACG 660 

88 TCCGTTGATA TGAATTCAAT AACCTCGACG GATGTTCGTA TTTTAGTCCA GCCCGGCATA 720 

90 GCCTCTGAGC TTGTGATCCC AAGTGAGCGC CTACACTATC GTAACCAAGG CTGGCGCTCC 780 

92 GTCGAGACCT CTGGGGTGGC TGAGGAGGAG GCTACCTCTG GTCTTGTTAT GCTTTGCATA 84 0 

94 CATGGCTCAC TCGTAAATTC CTATACTAAT ACACCCTATA CCGGTGCCCT CGGGCTGTTG 900 

96 GACTTTGCCC TTGAGCTTGA GTTTCGCAAC CTTACCCCCG GTAACACCAA TACGCGGGTC 960 
98 TCCCGTTATT CCAGCACTGC TCGCCACCGC CTTCGTCGCG GTGCGGACGG GACTGCCGAG 1020 

100 CTCACCACCA CGGCTGCTAC CCGCTTTATG AAGGACCTCT ATTTTACTAG TACTAATGGT 1080 

102 GTCGGTGAGA TCGGCCGCGG GATAGCCCTC ACCCTGTTCA ACCTTGCTGA CACTCTGCTT 1140 

104 GGCGGCCTGC CGACAGAATT GATTTCGTCG GCTGGTGGCC AGCTGTTCTA CTCCCGTCCC 1200 

106 GTTGTCTCAG CCAATGGCGA GCCGACTGTT AAGTTGTATA CATCTGTAGA GAATGCTCAG 12 60 

108 CAGGATAAGG GTATTGCAAT CCCGCATGAC ATTGACCTCG GAGAATCTCG TGTGGTTATT 1320 

110 CAGGATTATG ATAACCAACA TGAACAAGAT CGGCCGACGC. CTTCTCCAGC CCCATCGCGC 13 80 

112 CCTTTCTCTG TCCTTCGAGC TAATGATGTG CTTTGGCTCT CTCTCACCGC TGCCGAGTAT 1440 

114 GACCAGTCCA CTTATGGCTC TTCGACTGGC CCAGTTTATG TTTCTGACTC TGTGACCTTG 1500 

116 GTTAATGTTG CGACCGGCGC GCAGGCCGTT GCCCGGTCGC TCGATTGGAC CAAGGTCACA 1560 

118 CTTGACGGTC GCCCCCTCTC CACCATCCAG CAGTACTCGA AGACCTTCTT TGTCCTGCCG 1620 

120 CTCCGCGGTA AGCTCTCTTT CTGGGAGGCA GGCACAACTA AAGCCGGGTA CCCTTATAAT 1680 

122 TATAACACCA CTGCTAGCGA CCAACTGCTT GTCGAGAATG CCGCCGGGCA CCGGGTCGCT 174 0 

124 ATTTCCACTT ACACCACTAG CCTGGGTGCT GGTCCCGTCT CCATTTCTGC GGTTGCCGTT 1800 

126 TTAGCCCCCC ACTCTGCGCT AGCATTGCTT GAGGATACCT TGGACTACCC TGCCCGCGCC 1860 

128 CATACTTTTG ATGATTTCTG CCCAGAGTGC CGCCCCCTTG GCCTTCAGGG CTGCGCTTTC 1920 

130 CAGTCTACTG TCGCTGAGCT TCAGCGCCTT AAGATGAAGG TGGGTAAAAC TCGGGAGTTG 1980 

132 TAGTTTATTT GCTTGTGCCC CCCTTCTTTC TGTTGCTTAT TTCTCATTTC TGCGTTCCGC 2040 

134 GCTCCCTGA 2049 

13 6 (2) INFORMATION FOR SEQ ID NO: 2: 

138 (i) SEQUENCE CHARACTERISTICS: 

139 (A) LENGTH: 2058 base pairs 

140 (B) TYPE: nucleic acid 

W--> 149 (C) STRANDEDNESS: Hepatitis E Virus (Mexico Strain) 

150 ORF-2 region 

142 (D) TOPOLOGY: linear 

144 (ii) MOLECULE TYPE: DNA (genomic) 

146 (iii) HYPOTHETICAL: NO 

14 8 (vi) ORIGINAL SOURCE: 

152 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2: 

154 ATGCGCCCTA GGCCTCTTTT GCTGTTGTTC CTCTTGTTTC TGCCTATGTT GCCCGCGCCA 60 

156 CCGACCGGTC AGCCGTCTGG CCGCCGTCGT GGGCGGCGCA GCGGCGGTAC CGGCGGTGGT 120 

158 TTCTGGGGTG ACCGGGTTGA TTCTCAGCCC TTCGCAATCC CCTATATTCA TCCAACCAAC 180 

160 CCCTTTGCCC CAGACGTTGC CGCTGCGTCC GGGTCTGGAC CTCGCCTTCG CCAACCAGCC 24 0 

162 CGGCCACTTG GCTCCACTTG GCGAGATCAG GCCCAGCGCC CCTCCGCTGC CTCCCGTCGC 300 
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PATENT APPLICATION: US/09/769,066 TIME: 17:06:08 
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Output Set: N:\CRF3\04262001\I769066.raw 

164 CGACCTGCCA CAGCCGGGGC TGCGGCGCTG ACGGCTGTGG CGCCTGCCCA TGACACCTCA 360 

166 CCCGTCCCGG ACGTTGATTC TCGCGGTGCA ATTCTACGCC GCCAGTATAA TTTGTCTACT 4 20 

168 TCACCCCTGA CATCCTCTGT GGCCTCTGGC ACTAATTTAG TCCTGTATGC AGCCCCCCTT 4 80 

170 AATCCGCCTC TGCCGCTGCA GGACGGTACT AATACTCACA TTATGGCCAC AGAGGCCTCC 540 

172 AATTATGCAC AGTACCGGGT TGCCCGCGCT ACTATCCGTT ACCGGCCCCT AGTGCCTAAT 600 

174 GCAGTTGGAG GCTATGCTAT ATCCATTTCT TTCTGGCCTC AAACAACCAC AACCCCTACA 660 

176 TCTGTTGACA TGAATTCCAT TACTTCCACT GATGTCAGGA TTCTTGTTCA ACCTGGCATA 720 

178 GCATCTGAAT TGGTCATCCC AAGCGAGCGC CTTCACTACC GCAATCAAGG TTGGCGCTCG 7 80 

180 GTTGAGACAT CTGGTGTTGC TGAGGAGGAA GCCACCTCCG GTCTTGTCAT GTTATGCATA 840 

182 CATGGCTCTC CAGTTAACTC CTATACCAAT ACCCCTTATA CCGGTGCCCT TGGCTTACTG 900 

184 GACTTTGCCT TAGAGCTTGA GTTTCGCAAT CTCACCACCT GTAACACCAA TACACGTGTG 960 

186 TCCCGTTACT CCAGCACGGC CCGTCACCGG CTCCGCCGAG GGGCCGACGG GACTGCGGAG 1020 

188 CTGACCACAA CTGCAGCCAC CAGGTTCATG AAAGATCTCC ACTTTACCGG CCTTAATGGG 1080 

190 GTAGGTGAAG TCGGCCGCGG GATAGCTCTA ACATTACTTA ACCTTGCTGA CACGCTCCTC 1140 

192 GGCGGGCTCC CGACAGAATT AATTTCGTCG GCTGGCGGGC AACTGTTTTA TTCCCGCCCG 1200 

194 GTTGTCTCAG CCAATGGCGA GCCAACCGTG AAGCTCTATA CATCAGTGGA GAATGCTCAG 1260 

196 CAGGATAAGG GTGTTGCTAT CCCCCACGAT ATCGATCTTG GTGATTCGCG TGTGGTCATT 1320 

198 CAGGATTATG ACAACCAGCA TGAGCAGGAT CGGCCCACCC CGTCGCCTGC GCCATCTCGG 13 80 

200 CCTTTTTCTG TTCTCCGAGC AAATGATGTA CTTTGGCTGT CCCTCACTGC AGCCGAGTAT 1440 

202 GACCAGTCCA CTTACGGGTC GTCAACTGGC CCGGTTTATA TCTCGGACAG CGTGACTTTG 1500 

204 GTGAATGTTG CGACTGGCGC GCAGGCCGTA GCCCGATCGC TTGACTGGTC CAAAGTCACC 1560 

206 CTCGACGGGC GGCCCCTCCC GACTGTTGAG CAATATTCCA AGACATTCTT TGTGCTCCCC 1620 

208 CTTCGTGGCA AGCTCTCCTT TTGGGAGGCC GGCACAACAA AAGCAGGTTA TCCTTATAAT 1680 

210 TATAATACTA CTGCTAGTGA CCAGATTCTG ATTGAAAATG CTGCCGGCCA TCGGGTCGCC 1740 

212 ATTTCAACCT ATACCACCAG GCTTGGGGCC GGTCCGGTCG CCATTTCTGC GGCCGCGGTT 1800 

214 TTGGCTCCAC GCTCCGCCCT GGCTCTGCTG GAGGATACTT TTGATTATCC GGGGCGGGCG . 1860 

216 CACACATTTG ATGACTTCTG CCCTGAATGC CGCGCTTTAG GCCTCCAGGG TTGTGCTTTC 1920 

218 CAGTCAACTG TCGCTGAGCT CCAGCGCCTT AAAGTTAAGG TGGGTAAAAC TCGGGAGTTG 1980 

220 TAGTTTATTT GGCTGTGCCC ACCTACTTAT ATCTGCTGAT TTCCTTTATT TCCTTTTTCT 204 0 

.222 CGGTCCCGCG CTCCCTGA 2058 
224 (2) INFORMATION FOR SEQ ID NO: 3: 



226 (i) SEQUENCE CHARACTERISTICS : 

227 (A) LENGTH: 1647 base pairs 

228 (B) TYPE: nucleic acid 

229 (C) STRAND EDNESS : double 

230 (D) TOPOLOGY: linear 

232 (ii) MOLECULE TYPE: DNA (genomic) 

234 (iii) HYPOTHETICAL: NO ' 

236 (vi) ORIGINAL SOURCE: 

237 (C) INDIVIDUAL ISOLATE: Hepatitis E virus (Burma) r62kDa, 

238 FIGURE 2 
241 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



243 GCGGTCGCTC CGGCCCATGA CACCCCGCCA GTGCCTGATG TCGACTCCCG CGGCGCCATC 60 

245 TTGCGCCGGC AGTATAACCT ATCAACATCT CCCCTTACCT CTTCCGTGGC CACCGGCACT 120 

247 AACCTGGTTC TTTATGCCGC CCCTCTTAGT CCGCTTTTAC CCCTTCAGGA CGGCACCAAT 180 

249 ACCCATATAA TGGCCACGGA AGCTTCTAAT TATGCCCAGT ACCGGGTTGC CCGTGCCACA 24 0 

251 ATCCGTTACC GCCCGCTGGT CCCCAATGCT GTCGGCGGTT ACGCCATCTC CATCTCATTC 300 

253 TGGCCACAGA CCACCACCAC CCCGACGTCC GTTGATATGA ATTCAATAAC CTCGACGGAT 3 60 

255 GTTCGTATTT TAGTCCAGCC CGGCATAGCC TCTGAGCTTG TGATCCCAAG TGAGCGCCTA 420 
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GGAGGAGGCT 4 80 
TACTAATACA . 540 

TCGCAACCTT 600 

CCACCGCCTT 660 

CTTTATGAAG 720 

AGCCCTCACC 7 80 

TTCGTCGGCT 840 

GACTGTTAAG 900 

GCATGACATT 960 

ACAAGATCGG 1020 

TGATGTGCTT 1080 

GACTGGCCCA 1140 

GGCCGTTGCC 1200 

CATCCAGCAG 1260 

GGAGGCAGGC 1320 

ACTGCTTGTC 1380 

GGGTGCTGGT 1440 

ATTGCTTGAG 1500 

AGAGTGCCGC 1560 

GCGCCTTAAG 1620 

297 ATGAAGGTGG GTAAAACTCG GGAGTTG 164 7 
299 (2) INFORMATION FOR SEQ ID NO : 4: 



301 (i) SEQUENCE CHARACTERISTICS: 

302 (A) LENGTH: 1647 base pairs 

303 (B) TYPE: nucleic acid 

W--> 312 (C) STRANDEDNESS : Hepatitis E virus (Mexico strain) 

313 r62kDa, FIGURE 2 

305 (D) TOPOLOGY: linear 

307 (ii) MOLECULE TYPE: DNA (genomic) 

309 (iii) HYPOTHETICAL: NO 

311 (vi) ORIGINAL SOURCE: 

316 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



317 GCTGTGGCGC CTGCCCATGA CACCTCACCC GTCCCGGACG TTGATTCTCG CGGTGCAATT 60 

319 CTACGCCGCC AGTATAATTT GTCTACTTCA CCCCTGACAT CCTCTGTGGC CTCTGGCACT 120 

3 21 AATTTAGTCC TGTATGCAGC CCCCCTTAAT CCGCCTCTGC CGCTGCAGGA CGGTACTAAT 180 

323 ACTCACATTA TGGCCACAGA GGCCTCCAAT TATGCACAGT ACCGGGTTGC CCGCGCTACT 240 

325 ATCCGTTACC GGCCCCTAGT GCCTAATGCA GTTGGAGGCT ATGCTATATC CATTTCTTTC 300 

327 TGGCCTCAAA CAACCACAAC CCCTACATCT GTTGACATGA ATTCCATTAC TTCCACTGAT 3 60 

3 29 GTCAGGATTC TTGTTCAACC TGGCATAGCA TCTGAATTGG TCATCCCAAG CGAGCGCCTT 420 

331 CACTACCGCA ATCAAGGTTG GCGCTCGGTT GAGACATCTG GTGTTGCTGA GGAGGAAGCC 4 80 

333 ACCTCCGGTC TTGTCATGTT ATGCATACAT GGCTCTCCAG TTAACTCCTA TACCAATACC 54 0 

335 CCTTATACCG GTGCCCTTGG CTTACTGGAC TTTGCCTTAG AGCTTGAGTT TCGCAATCTC 600 

3 37 ACCACCTGTA ACACCAATAC ACGTGTGTCC CGTTACTCCA GCACGGCCCG TCACCGGCTC 660 

339 CGCCGAGGGG CCGACGGGAC TGCGGAGCTG ACCACAACTG CAGCCACCAG GTTCATGAAA 720 

341 GATCTCCACT TTACCGGCCT TAATGGGGTA GGTGAAGTCG GCCGCGGGAT AGCTCTAACA 780 

34 3 TTACTTAACC TTGCTGACAC GCTCCTCGGC GGGCTCCCGA CAGAATTAAT TTCGTCGGCT 840 

345 GGCGGGCAAC TGTTTTATTC CCGCCCGGTT GTCTCAGCCA ATGGCGAGCC AACCGTGAAG 900 

347 CTCTATACAT CAGTGGAGAA TGCTCAGCAG GATAAGGGTG TTGCTATCCC CCACGATATC 960 

349 GATCTTGGTG ATTCGCGTGT GGTCATTCAG GATTATGACA ACCAGCATGA GCAGGATCGG 1020 



257 CACTATCGTA ACCAAGGCTG GCGCTCCGTC GAGACCTCTG GGGTGGCTGA 
259 ACCTCTGGTC TTGTTATGCT TTGCATACAT GGCTCACTCG TAAATTCCTA 
261 CCCTATACCG GTGCCCTCGG GCTGTTGGAC TTTGCCCTTG AGCTTGAGTT 
263 ACCCCCGGTA ACACCAATAC GCGGGTCTCC CGTTATTCCA GCACTGCTCG 
265 CGTCGCGGTG CGGACGGGAC TGCCGAGCTC ACCACCACGG CTGCTACCCG 
267 GACCTCTATT TTACTAGTAC TAATGGTGTC GGTGAGATCG GCCGCGGGAT 
269 CTGTTCAACC TTGCTGACAC TCTGCTTGGC GGCCTGCCGA CAGAATTGAT 
271 GGTGGCCAGC TGTTCTACTC CCGTCCCGTT GTCTCAGCCA ATGGCGAGCC 
273 TTGTATACAT CTGTAGAGAA TGCTCAGCAG GATAAGGGTA TTGCAATCCC 
275 GACCTCGGAG AATCTCGTGT GGTTATTCAG GATTATGATA ACCAACATGA 
277 CCGACGCCTT CTCCAGCCCC ATCGCGCCCT TTCTCTGTCC TTCGAGCTAA 
279 TGGCTCTCTC TCACCGCTGC CGAGTATGAC CAGTCCACTT ATGGCTCTTC 
281 GTTTATGTTT CTGACTCTGT GACCTTGGTT AATGTTGCGA CCGGCGCGCA 
283 CGGTCGCTCG ATTGGACCAA GGTCACACTT GACGGTCGCC CCCTCTCCAC 
285 TACTCGAAGA CCTTCTTTGT CCTGCCGCTC CGCGGTAAGC TCTCTTTCTG 
287 ACAACTAAAG CCGGGTACCC TTATAATTAT AACACCACTG CTAGCGACCA 
289 GAGAATGCCG CCGGGCACCG GGTCGCTATT TCCACTTACA CCACTAGCCT 
291 CCCGTCTCCA TTTCTGCGGT TGCCGTTTTA GCCCCCCACT CTGCGCTAGC 
293 GATACCTTGG ACTACCCTGC CCGCGCCCAT ACTTTTGATG ATTTCTGCCC 
295 CCCCTTGGCC TTCAGGGCTG CGCTTTCCAG TCTACTGTCG CTGAGCTTCA 
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351 CCCACCCCGT CGCCTGCGCC ATCTCGGCCT TTTTCTGTTC TCCGAGCAAA TGATGTACTT 1080 

353 TGGCTGTCCC TCACTGCAGC CGAGTATGAC CAGTCCACTT ACGGGTCGTC AACTGGCCCG 1140 

3 55 GTTTATATCT CGGACAGCGT GACTTTGGTG AATGTTGCGA CTGGCGCGCA GGCCGTAGCC 1200 

357 CGATCGCTTG ACTGGTCCAA AGTCACCCTC GACGGGCGGC CCCTCCCGAC TGTTGAGCAA 1260 

359 TATTCCAAGA CATTCTTTGT GCTCCCCCTT CGTGGCAAGC TCTCCTTTTG GGAGGCCGGC 1320 

361 ACAACAAAAG CAGGTTATCC TTATAATTAT AATACTACTG CTAGTGACCA GATTCTGATT 1380 

3 63 GAAAATGCTG CCGGCCATCG GGTCGCCATT TCAACCTATA CCACCAGGCT TGGGGCCGGT 1440 

3 65 CCGGTCGCCA TTTCTGCGGC CGCGGTTTTG GCTCCACGCT CCGCCCTGGC TCTGCTGGAG 1500 

367 GATACTTTTG ATTATCCGGG GCGGGCGCAC ACATTTGATG ACTTCTGCCC TGAATGCCGC 1560 

369 GCTTTAGGCC TCCAGGGTTG TGCTTTCCAG TCAACTGTCG CTGAGCTCCA GCGCCTTAAA 1620 

371 GTTAAGGTGG GTAAAACTCG GGAGTTG * 1647 

373 (2) INFORMATION FOR SEQ ID NO: 5: 



375 (i) SEQUENCE CHARACTERISTICS: 

376 (A) LENGTH: 984 base pairs 

377 (B) TYPE: nucleic acid 

W--> 386 (C) STRANDEDNESS : Hepatitis E Virus (Burma strain) SG3 

387 region 

379 (D) TOPOLOGY: linear 

3 81 (ii) MOLECULE TYPE: DNA (genomic) 

383 (iii) HYPOTHETICAL: NO 

385 (vi) ORIGINAL SOURCE: 

3 90 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 



3 92 GGTGCGGACG GGACTGCCGA GCTCACCACC ACGGCTGCTA CCCGCTTTAT GAAGGACCTC 60 

3 94 TATTTTACTA GTACTAATGG TGTCGGTGAG ATCGGCCGCG GGATAGCCCT CACCCTGTTC 120 

3 96 AACCTTGCTG ACACTCTGCT TGGCGGCCTG CCGACAGAAT TGATTTCGTC GGCTGGTGGC 180 

3 98 CAGCTGTTCT ACTCCCGTCC CGTTGTCTCA GCCAATGGCG AGCCGACTGT TAAGTTGTAT 240 

4 00 ACATCTGTAG AGAATGCTCA GCAGGATAAG GGTATTGCAA TCCCGCATGA CATTGACCTC 300 
402 GGAGAATCTC GTGTGGTTAT TCAGGATTAT GATAACCAAC ATGAACAAGA TCGGCCGACG 360 
404 CCTTCTCCAG CCCCATCGCG CCCTTTCTCT GTCCTTCGAG CTAATGATGT GCTTTGGCTC 420 
4 06 TCTCTCACCG CTGCCGAGTA TGACCAGTCC ACTTATGGCT CTTCGACTGG CCCAGTTTAT 480 
408 GTTTCTGACT CTGTGACCTT GGTTAATGTT GCGACCGGCG CGCAGGCCGT TGCCCGGTCG 540 
410 CTCGATTGGA CCAAGGTCAC ACTTGACGGT CGCCCCCTCT CCACCATCCA GCAGTACTCG 600 
412 AAGACCTTCT TTGTCCTGCC GCTCCGCGGT AAGCTCTCTT TCTGGGAGGC AGGCACAACT 660 
414 AAAGCCGGGT ACCCTTATAA TTATAACACC ACTGCTAGCG ACCAACTGCT TGTCGAGAAT 720 
416 GCCGCCGGGC ACCGGGTCGC TATTTCCACT TACACCACTA GCCTGGGTGC TGGTCCCGTC 780 
418 TCCATTTCTG CGGTTGCCGT TTTAGCCCCC CACTCTGCGC TAGCATTGCT TGAGGATACC 84 0 
420 TTGGACTACC CTGCCCGCGC CCATACTTTT GATGATTTCT GCCCAGAGTG CCGCCCCCTT 900 
422 GGCCTTCAGG GCTGCGCTTT CCAGTCTACT GTCGCTGAGC TTCAGCGCCT TAAGATGAAG 960 
424 GTGGGTAAAA CTCGGGAGTT GTAG 984 
426 (2) INFORMATION FOR SEQ ID NO: 6: 



4 28 (i) SEQUENCE CHARACTERISTICS: 

429 (A) LENGTH: 984 base pairs 

430 (B) TYPE: nucleic acid 

W--> 439 (C) STRANDEDNESS: Hepatits E Virus (Mexico strain) SG3 

440 region 

432 (D) TOPOLOGY: linear 

4 34 (ii) MOLECULE TYPE: DNA (genomic) 

436 (iii) HYPOTHETICAL: NO 

438 (vi) ORIGINAL SOURCE: 
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Input Set : N:\Crf3\RULE60\09769066.txt 
Output Set: N:\CRF3\04262001\I769066.raw 

Keyword misspelled or invalid format, [(A) APPLICATION NUMBER : ] 
Keyword misspelled or invalid format, [(B) FILING DATE:] 
Keyword misspelled or invalid format, [(C) STRAND EDNESS : ] 
Invalid value of Alpha Sequence Header Field, [STRANDEDNESS : ] , SeqNo^l 
Keyword misspelled or invalid format, [(C) STRANDEDNESS : ] 
Invalid value of Alpha Sequence Header Field, [STRANDEDNESS:], SeqNo=2 
Keyword misspelled or invalid format, [(C) STRANDEDNESS : ] 
Invalid value of Alpha Sequence Header Field, [STRANDEDNESS:], SeqNo=4 
Keyword misspelled or' invalid format, [(C) STRANDEDNESS:] 
Invalid value of Alpha Sequence Header Field, [STRANDEDNESS:], SeqNo=5 
Keyword misspelled or invalid format, [(C) STRANDEDNESS:] 
Invalid value of Alpha Sequence Header Field, [STRANDEDNESS : ] , SeqNo=6 
Keyword misspelled or invalid format, [(C) STRANDEDNESS:] 
Invalid value of Alpha Sequence Header Field, [STRANDEDNESS:], SeqNo=7 
Keyword misspelled or invalid format, [(C) STRANDEDNESS:] 
Invalid value of Alpha Sequence Header Field, [STRANDEDNESS : ] , SeqNo=8 
Keyword misspelled or invalid format, [(C) STRANDEDNESS : ] 
Invalid value of Alpha Sequence Header Field, [STRANDEDNESS:], SeqNo=9 
Keyword misspelled or invalid format, [(C) STRANDEDNESS : ] 
Invalid value of Alpha Sequence Header Field, [STRANDEDNESS:], SeqNo=10 
Keyword misspelled or invalid format, [(C) STRANDEDNESS : ] 
Invalid value of Alpha Sequence Header Field, [STRANDEDNESS:], SeqNo=ll 
Keyword misspelled or invalid format, [(C) STRANDEDNESS : ] 
Invalid value of Alpha Sequence Header Field, [STRANDEDNESS:], SeqNo=12 
Keyword misspelled or invalid format, [(C) STRANDEDNESS:] 
Invalid value of Alpha Sequence Header Field, [STRANDEDNESS:], SeqNo=13 
Keyword misspelled or invalid format, [(C) STRANDEDNESS:] 
Invalid value of Alpha Sequence Header Field, [STRANDEDNESS:], SeqNo=14 
Keyword misspelled or invalid format, [(C) STRANDEDNESS : ] 
Invalid value of Alpha Sequence Header Field, [STRANDEDNESS:], SeqNo=15 
Keyword misspelled or invalid format, [(C) STRANDEDNESS:] 
Invalid value of Alpha Sequence Header Field, [STRANDEDNESS : ] , SeqNo-16 
Keyword misspelled or invalid format, [(C) STRANDEDNESS:] 
Invalid value of Alpha Sequence Header Field, [STRANDEDNESS:], SeqNo=17 
Keyword misspelled or invalid format, [(C) STRANDEDNESS:] 
Invalid value of Alpha Sequence Header Field, [STRANDEDNESS:], SeqNo=18 
Keyword misspelled or invalid format, [(C) STRANDEDNESS:] 
Invalid value of Alpha Sequence Header Field, [STRANDEDNESS:], SeqNo=19 
Keyword misspelled or invalid format, [(C) STRANDEDNESS : ] 
Invalid value of Alpha Sequence Header Field, [STRANDEDNESS:], SeqNo=20 
Keyword misspelled or invalid format, [(C) STRANDEDNESS:] 
Invalid value of Alpha Sequence Header Field, [STRANDEDNESS : J , SeqNo=21 
Keyword misspelled or invalid format, [(C) STRANDEDNESS:] 
Invalid value of Alpha Sequence Header Field, [STRANDEDNESS:], SeqNo=22 
Keyword misspelled or invalid format, [(C) STRANDEDNESS:] 
Invalid value of Alpha Sequence Header Field, [STRANDEDNESS:], SeqNo=23 
Keyword misspelled or invalid format, [(C) STRANDEDNESS:] 
Invalid value of Alpha Sequence Header Field, [STRANDEDNESS:], SeqNo=24 
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Input Set : N:\Crf3\RULE60\09769066.txt 
Output Set: N:\CRF3\04262001\I769066.raw 
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misspelled or 
value of Alpha 
misspelled or 
value of Alpha 
misspelled or 
value of Alpha 
misspelled or 
value of Alpha 
value of Alpha 
value of Alpha 
or "Xaa" used 
or "Xaa" used 



invalid format, 
Sequence Header 

invalid format. 
Sequence Header 

invalid format, 
Sequence Header 

invalid format, 
Sequence Header 
Sequence Header 
Sequence Header 
for SEQ ID#:31 
for SEQ ID#:31 



[(C) STRANDEDNESS : ] 
Field, [STRANDEDNESS:], SeqNo=25 

[(C) STRANDEDNESS : ] 
Field, [STRANDEDNESS : ] , SeqNo=26 

[(C) STRANDEDNESS:] 
Field, [STRANDEDNESS:], SeqNo=27 

[(C) STRANDEDNESS:] 
Field, [STRANDEDNESS:], SeqNo=28 
Field, [MOLECULE TYPE : ] , SeqNo=29 
Field, [MOLECULE TYPE:], SeqNo=30 
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